/Cffp Errors Corrected by the STIC Sterna Branch / , 

SerM Numbsr. Ol/fflf WA TP R C r^' ^^ l 

| — | Changed a file from non-ASCII to ASCII *— " C (J Verified by: / f\s Z (STIC slaff) 



| [ Changed the margins in cases where the sequence text was trapped* down to the next line. 
| | Edited a format error in the Current^Application Data section, specifically: 

v. 

I I Edited the Current Application Data section with the actual current number. The number inputted by the 
applicant was O the prior application data; or Q other . 

p~] Added the mandatory heading and subheadings for "Current Application Data". 

| | Edited the "Number of Sequences" field. The applicant spelled out a number instead of using an integer. 

|~] Changed the spelling of a mandatory field (the headings or subheadings), specifically: 

| | Corrected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 

[ | Inserted or corrected a nucleic number at the end of a nucleic line. SEQ ID HOs edited: 

[ | Corrected subheading placement. All responses must be on the same line as each subheading. If the 
applicant placed a response below the subheading, this was moved to its appropriate place. 

[ [ Inserted colons after headings/subheadings. Headings edited included: . 

| | Deleted extra, invalid, headings used by an applicant, specifically: 

Deleted: ion-ASCII "garbage" at the beginning/end of files; □ secretary initials/filename at end of file; 
□ page numbers throughout text; □ other invalid text such as . 

| | Inserted mandatory headings, specifically : 



Corrected an obvious error in the response, specifically: 



| 1 Edited idejotiftars where upper case is used but lower case is required, or vice versa. 

1~1 Corrected an error in the Number of Sequences field, specifically: 

| | A "Hard Page Break" code was inserted by the applicant All occurrences had to be deleted.. 

| | Deleted ending stop codon in amino acid sequences and adjusted the "(A)Length: - field accordingly (error 
due to a Patentln bug). Sequences corrected: 

□ Other . . 



applicant In the first Office 

3/1/95 



RAW SEQUENCE LISTING DATE: 02/19/2002 

PATENT APPLICATION: US/09/941, 947A TIME: 13:19:23 

Input Set : N:\Crf3\02102002\I941947A.raw 
Output Set: N:\CRF3\02192002\I941947A.raw 

1 <110> APPLICANT: Brzostowicz, Patricia C. 



2 Cheng, Qiong 

3 DiCosimo, Deana J. 

4 Koffas, Mattheos 

5 Miller, Edward S. Jr. 

6 Odom, J. Martin 

7 Picataggio, Steve 

8 Rouviere, Pierre E. 



9 <120> TITLE OF INVENTION: CAROTENOID PRODUCTION FROM A SINGLE CARBON SOURCE 

10 <130> FILE REFERENCE: CL1903 US NA 

C--> 11 <140> CURRENT APPLICATION NUMBER: US/09/941 , 947A 

12 <141> CURRENT FILING DATE: 2001-09*01 

13 <150> PRIOR APPLICATION NUMBER: 60/229,907 

14 <151> PRIOR FILING DATE: 2000-09-01 

15 <150> PRIOR APPLICATION NUMBER: 60/229,858 

16 <151> PRIOR FILING DATE: 2000-09*01 

17 <160> NUMBER OF SEQ ID NOS : 60 

18 <170> SOFTWARE: Microsoft Office 97 

20 <210> SEQ ID NO: 1 

21 <211> LENGTH: 1311 

22 <212> TYPE: DNA 

23 <213> ORGANISM: Methylomonas 16a 

24 <400> SEQUENCE: 1 



25 gatgtggtca catggcccta tcacttaacg gctgatattc gattttgtca ttggtttttt 60 

26 cttaacttta acttctacac gctcatgaac aaacctaaaa aagttgcaat actgacagca 120 

27 ggcggcttgg cgccttgttt gaattccgca atcggtagtt tgatcgaacg ttataccgaa 180 

28 atcgatccta gcatagaaat catttgctat cgcggcggtt ataaaggcct gttgctgggc 240 

29 gattcttatc cagtaacggc cgaagtgcgt aaaaaggcgg gtgttctgca acgttttggc 300 

30 ggttctgtga tcggcaacag ccgcgtcaaa ttgaccaatg tcaaagactg cgtgaaacgc 360 

31 ggtttggtca aagagggtga agatccgcaa aaagtcgcgg ctgatcaatt ggttaaggat 420 

32 ggtgtcgata ttctgcacac catcggcggc gatgatacca atacggcagc agcggatttg 480 

33 gcagcattcc tggccagaaa taattacgga ctgaccgtca ttggtttacc taaaaccgtc 540 

34 gataacgacg tatttccgat caagcaatca ctaggtgctt ggactgccgc cgagcaaggc 600 

35 gcgcgttatt tcatgaacgt ggtggccgaa aacaacgcca acccacgcat gctgatcgta 660 

36 cacgaagtga tgggccgtaa ctgcggctgg ctgaccgctg caaccgcgca ggaatatcgc 720 

37 aaattactgg accgtgccga gtggttgccg gaattgggtt tgactcgtga atcttatgaa 780 

38 gtgcacgcgg tattcgttcc ggaaatggcg atcgacctgg aagccgaagc caagcgcctg 840 

39 cgcgaagtga tggacaaagt cgattgcgtc aacatcttcg tttccgaagg tgccggcgtc 900 

40 gaagctatcg tcgcggaaat gcaggccaaa ggccaggaag tgccgcgcga tgcgttcggc 960 

41 cacatcaaac tggatgcggt caaccctggt aaatggttcg gcgagcaatt cgcgcagatg 1020 

42 ataggcgcgg aaaaaaccct ggtacaaaaa tcgggatact tcgcccgtgc ttctgcttcc 1080 

43 aacgttgacg acatgcgttt gatcaaatcg tgcgccgact tggcggtcga gtgcgcgttc 1140 

44 cgccgcgagt ctggcgtgat cggtcacgac gaagacaacg gcaacgtgtt gcgtgcgatc 1200 
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45 gagtttccgc gcatcaaggg cggcaaaccg ttcaatatcg acaccgactg gttcaatagc 1260 

46 atgttgagcg aaatcggcca gcctaaaggc ggtaaagtcg aagtcagcca c 1311 

48 <210> SEQ ID NO: 2 

49 <211> LENGTH: 437 

50 <212> TYPE: PRT 

51 <213> ORGANISM: Methylomonas 16a 



52 <400> SEQUENCE: 2 
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95 Phe Ala Gin Met lie Gly Ala Glu Lys Thr Leu Val Gin Lys Ser Gly 

96 340 345 350 

97 Tyr Phe Ala Arg Ala Ser Ala Ser Asn Val Asp Asp Met Arg Leu lie 

98 355 360 365 

99 Lys Ser Cys Ala Asp Leu Ala Val Glu Cys Ala Phe Arg Arg Glu Ser 

100 370 375 380 

101 Gly Val lie Gly His Asp Glu Asp Asn Gly Asn Val Leu Arg Ala lie 

102 385 390 395 400 

103 Glu Phe Pro Arg lie Lys Gly Gly Lys Pro Phe Asn lie Asp Thr Asp 

104 405 410 415 

105 Trp Phe Asn Ser Met Leu Ser Glu lie Gly Gin Pro Lys Gly Gly Lys 

106 420 425 430 

107 Val Glu Val Ser His 

108 435 



110 <210> SEQ ID NO: 3 

111 <211> LENGTH: 636 

112 <212> TYPE: DNA 

113 <213> ORGANISM: Methylomonas 16a 

114 <400> SEQUENCE: 3 



115 gaaaatacta tgtccgtcac catcaaagaa gtcatgacca cctcgcccgt tatgccggtc 60 

116 atggtcatca atcatctgga acatgccgtc cctctggctc gcgcgctagt cgacggtggc 120 

117 ttgaaagttt tggagatcac attgcgcacg ccggtggcac tggaatgtat ccgacgtatc 180 

118 aaagccgaag taccggacgc catcgtcggc gcgggcacca tcatcaaccc tcataccttg 240 

119 tatcaagcga ttgacgccgg tgcggaattc atcgtcagcc ccggcatcac cgaaaatcta 300 

120 ctcaacgaag cgctagcatc cggcgtgcct atcctgcccg gcgtcatcac acccagcgag 360 

121 gtcatgcgtt tattggaaaa aggcatcaat gcgatgaaat tctttccggc tgaagccgcc 420 

122 ggcggcatac cgatgctgaa atcccttggc ggccccttgc cgcaagtcac cttctgtccg 480 

123 accggcggcg tcaatcccaa aaacgcgccc gaatatctgg cattgaaaaa tgtcgcctgc 540 

124 gtcggcggct cctggatggc gccggccgat ctggtagatg ccgaagactg ggcggaaatc 600 

125 acgcggcggg cgagcgaggc cgcggcattg aaaaaa 636 



127 <210> SEQ ID NO: 4 

128 <211> LENGTH: 212 

129 <212> TYPE: PRT 

130 <213> ORGANISM: Methylomonas 16a 

131 <400> SEQUENCE: 4 
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14 6 Pro Gly Val lie Thr Pro Ser Glu Val Met Arg Leu Leu Glu Lys Gly 

147 115 120 125 

14 8 lie Asn Ala Met Lys Phe Phe Pro Ala Glu Ala Ala Gly Gly lie Pro 

149 130 135 140 

150 Met Leu Lys Ser Leu Gly Gly Pro Leu Pro Gin Val Thr Phe Cys Pro 

151 145 150 155 160 

152 Thr Gly Gly Val Asn Pro Lys Asn Ala Pro Glu Tyr Leu Ala Leu Lys 

153 165 170 175 

154 Asn Val Ala Cys Val Gly Gly Ser Trp Met Ala Pro Ala Asp Leu Val 

155 180 185 190 

156 Asp Ala Glu Asp Trp Ala Glu lie Thr Arg Arg Ala Ser Glu Ala Ala 

157 195 200 205 

158 Ala Leu Lys Lys 

159 210 



161 <210> SEQ ID NO: 5 

162 <211> LENGTH: 1860 

163 <212> TYPE: DNA 

164 <213> ORGANISM: Methylomonas 16a 

165 <400> SEQUENCE: 5 



166 atgaaactga ccaccgacta tcccttgctt aaaaacatcc acacgccggc ggacatacgc 60 

167 gcgctgtcca aggaccagct ccagcaactg gctgacgagg tgcgcggcta tctgacccac 120 

168 acggtcagca tttccggcgg ccattttgcg gccggcctcg gcaccgtgga actgaccgtg 180 

169 gccttgcatt atgtgttcaa tacccccgtc gatcagttgg tctgggacgt gggccatcag 240 

170 gcctatccgc acaagattct gaccggtcgc aaggagcgca tgccgaccat tcgcaccctg 300 

171 ggcggggtgt cagcctttcc ggcgcgggac gagagcgaat acgatgcctt cggcgtcggc 360 

172 cattccagca cctcgatcag cgcggcactg ggcatggcca ttgcgtcgca gctgcgcggc 420 

173 gaagacaaga agatggtagc catcatcggc gacggttcca tcaccggcgg catggcctat 480 

174 gaggcgatga atcatgccgg cgatgtgaat gccaacctgc tggtgatctt gaacgacaac 540 

175 gatatgtcga tctcgccgcc ggtcggggcg atgaacaatt atctgaccaa ggtgttgtcg 600 

176 agcaagtttt attcgtcggt gcgggaagag agcaagaaag ctctggccaa gatgccgtcg 660 

177 gtgtgggaac tggcgcgcaa gaccgaggaa cacgtgaagg gcatgatcgt gcccggtacc 720 

178 ttgttcgagg aattgggctt caattatttc ggcccgatcg acggccatga tgtcgagatg 780 

179 ctggtgtcga ccctggaaaa tctgaaggat ttgaccgggc cggtattcct gcatgtggtg 840 

180 accaagaagg gcaaaggcta tgcgccagcc gagaaagacc cgttggccta ccatggcgtg 900 

181 ccggctttcg atccgaccaa ggatttcctg cccaaggcgg cgccgtcgcc gcatccgacc 960 

182 tataccgagg tgttcggccg ctggctgtgc gacatggcgg ctcaagacga gcgcttgctg 1020 

183 ggcatcacgc cggcgatgcg cgaaggctct ggtttggtgg aattctcaca gaaatttccg 1080 

184 aatcgctatt tcgatgtcgc catcgccgag cagcatgcgg tgaccttggc cgccggccag 1140 

185 gcctgccagg gcgccaagcc ggtggtggcg atttattcca ccttcctgca acgcggttac 1200 

186 gatcagttga tccacgacgt ggccttgcag aacttagata tgctctttgc actggatcgt 1260 

187 gccggcttgg tcggcccgga tggaccgacc catgctggcg cctttgatta cagctacatg 1320 

188 cgctgtattc cgaacatgct gatcatggct ccagccgacg agaacgagtg caggcagatg 1380 

189 ctgaccaccg gcttccaaca ccatggcccg gcttcggtgc gctatccgcg cggcaaaggg 1440 

190 cccggggcgg caatcgatcc gaccctgacc gcgctggaga tcggcaaggc cgaagtcaga 1500 

191 caccacggca gccgcatcgc cattctggcc tggggcagca tggtcacgcc tgccgtcgaa 1560 

192 gccggcaagc agctgggcgc gacggtggtg aacatgcgtt tcgtcaagcc gttcgatcaa 1620 

193 gccttggtgc tggaattggc caggacgcac gatgtgttcg tcaccgtcga ggaaaacgtc 1680 

194 atcgccggcg gcgctggcag tgcgatcaac accttcctgc aggcgcagaa ggtgctgatg 1740 

195 ccggtctgca acatcggcct gcccgaccgc ttcgtcgagc aaggtagtcg cgaggaattg 1800 
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RAW SEQUENCE LISTING DATE: 02/10/2002 

PATENT APPLICATION: US/09/94 1 , 94 7A TIME: 13:19:55 

Input Set : A:\CL1903 US NA revised seq list .txt 
Output Set: N:\CRF3\02102002\I941947A.raw 

APPLICANT: Brzostowicz, Patricia C. 
Cheng, Qiong 

DiCosimo, Deana J. D06S Not Comply 

Koffas, Mattheos Corrected Diskette Needed 

Miller, Edward S. Jr. 
Odom, J. Martin 
Picataggio, Steve 
Rouviere, Pierre E. 

TITLE OF INVENTION: CAROTENOID PRODUCTION FROM A SINGLE CARBON SOURCE 
FILE REFERENCE: CL1903 US NA 

CURRENT APPLICATION NUMBER: US/09/94 1 , 94 7 A 
CURRENT FILING DATE: 2001-09-01 

PRIOR APPLICATION NUMBER: 60/229/907 
PRIOR FILING DATE: 2000-09-01 
PRIOR APPLICATION NUMBER: 60/229,858 
PRIOR FILING DATE: 2000-09-01 
NUMBER OF SEQ ID NOS : 60 
SOFTWARE: Microsoft Office 97 



ERRORED SEQUENCES 

3085 <210> SEQ ID NO: 60 

3086 <211> LENGTH: 19 

3087 <212> TYPE: DNA 

3088 <213> ORGANISM: Artificial Sequence 

3090 <220> FEATURE: 

3091 <223> OTHER INFORMATION: primer 

3093 <400> SEQUENCE: 60 

3094 tagctcgagt cacgcttgc 19 



3097, 
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VERIFICATION SUMMARY DATE: 02/10/2002 

PATENT APPLICATION: US/09/941, 947A TIME: 13:19:56 

Input Set : A:\CL1903 US NA revised seq list .txt 
Output Set: N:\CRF3\02102002\I941947A.raw 

L:21 M:270 C: Current Application Number differs, Replaced Current Application No 
L:21 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:3097 M:254 E: No. of Bases conflict, LENGTH : Input : 52 Counted: 19 SEQ: 60 
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